SQLSAM: SQL for statistical analysis and modeling

نویسندگان

  • Joobin Choobineh
  • Anil Kini
چکیده

Abstrait Statistical modeling and analysis is extensively used in businesses for various purposes including graphic visualization of data, measurement of central tendencies and other statistics, and inferences on populations based on samples. Data are the fundamental component of each of these activities. In this paper an extension of the standard database language SQL for statistical modeling and analysis is presented. Models covered include descriptive analytic and graphic measures, discrete probability distributions, continuous probability distributions, inferential statistics, and regression analysis. Through the use of SQLSAM, seamless integration of existing data in organizations’ databases and their statistical analysis can be achieved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Splash: Integrated Ad-Hoc Querying of Data and Statistical Models

This paper presents a system called Splash, which integrates statistical modeling and SQL for the purpose of adhoc querying and analysis. Splash supports a novel, simple, and practical abstraction of statistical modeling as an aggregate function, which in turn provides for natural integration with standard SQL queries and a relational DBMS. In addition, we introduce and implement a novel repres...

متن کامل

What is the Population of Interest: Population Modeling for BayesDB

BayesDB [1, 2] is a probabilistic programming platform that enables users to solve probabilistic data analysis problems using a simple, SQL-like language. Queries execute against generative population models (GPMs), a new abstraction that can be used to integrate data, metadata, qualitative domain knowledge, and quantitative models. Baseline quantitative models are typically built via an AI mod...

متن کامل

Modeling of Banks ‌Bankruptcy in Iran (Multivariate Statistical Analysis)

In this paper we construct a modeling for detection of banks which are experiencing serious problems. Sample and variable set of the study contains 30 banks of Iran during 2006-2014 and their financial ratios. Well known multivariate statistical technique (principal component analysis) was used to explore the basic financial characteristics of the banks, and discriminant Logit and Probit models ...

متن کامل

BayesDB: A probabilistic programming system for querying the probable implications of data

Is it possible to make statistical inference broadly accessible to non-statisticians without sacrificing mathematical rigor or inference quality? This paper describes BayesDB, a probabilistic programming platform that aims to enable users to query the probable implications of their data as directly as SQL databases enable them to query the data itself. This paper focuses on four aspects of Baye...

متن کامل

Statistical Shape Modeling of Musculoskeletal Structures and Its Applications

Statistical shape models (SSM) describe the shape variability contained in a given population. They are able to describe large populations of complex shapes with few degrees of freedom. This makes them a useful tool for a variety of tasks that arise in computer-aided medicine. In this chapter we are going to explain the basic methodology of SSMs and present a variety of examples, where SSM has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995